Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zibo Lin

DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification

Apr 25, 2025

Guohao Huo, Zibo Lin, Zitong Wang, Ruiting Dai, Hao Tang

Abstract:Ophthalmic diseases pose a significant global health challenge, yet traditional diagnosis methods and existing single-eye deep learning approaches often fail to account for binocular pathological correlations. To address this, we propose DMS-Net, a dual-modal multi-scale Siamese network for binocular fundus image classification. Our framework leverages weight-shared Siamese ResNet-152 backbones to extract deep semantic features from paired fundus images. To tackle challenges such as lesion boundary ambiguity and scattered pathological distributions, we introduce a Multi-Scale Context-Aware Module (MSCAM) that integrates adaptive pooling and attention mechanisms for multi-resolution feature aggregation. Additionally, a Dual-Modal Feature Fusion (DMFF) module enhances cross-modal interaction through spatial-semantic recalibration and bidirectional attention, effectively combining global context and local edge features. Evaluated on the ODIR-5K dataset, DMS-Net achieves state-of-the-art performance with 80.5% accuracy, 86.1% recall, and 83.8% Cohen's kappa, demonstrating superior capability in detecting symmetric pathologies and advancing clinical decision-making for ocular diseases.

Via

Access Paper or Ask Questions

When Video Classification Meets Incremental Classes

Jun 30, 2021

Hanbin Zhao, Xin Qin, Shihao Su, Zibo Lin, Xi Li

Figure 1 for When Video Classification Meets Incremental Classes

Figure 2 for When Video Classification Meets Incremental Classes

Figure 3 for When Video Classification Meets Incremental Classes

Figure 4 for When Video Classification Meets Incremental Classes

Abstract:With the rapid development of social media, tremendous videos with new classes are generated daily, which raise an urgent demand for video classification methods that can continuously update new classes while maintaining the knowledge of old videos with limited storage and computing resources. In this paper, we summarize this task as \textit{Class-Incremental Video Classification (CIVC)} and propose a novel framework to address it. As a subarea of incremental learning tasks, the challenge of \textit{catastrophic forgetting} is unavoidable in CIVC. To better alleviate it, we utilize some characteristics of videos. First, we decompose the spatio-temporal knowledge before distillation rather than treating it as a whole in the knowledge transfer process; trajectory is also used to refine the decomposition. Second, we propose a dual granularity exemplar selection method to select and store representative video instances of old classes and key-frames inside videos under a tight storage budget. We benchmark our method and previous SOTA class-incremental learning methods on Something-Something V2 and Kinetics datasets, and our method outperforms previous methods significantly.

Via

Access Paper or Ask Questions

Dialogue Response Selection with Hierarchical Curriculum Learning

Dec 29, 2020

Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang

Figure 1 for Dialogue Response Selection with Hierarchical Curriculum Learning

Figure 2 for Dialogue Response Selection with Hierarchical Curriculum Learning

Figure 3 for Dialogue Response Selection with Hierarchical Curriculum Learning

Figure 4 for Dialogue Response Selection with Hierarchical Curriculum Learning

Abstract:We study the learning of a matching model for dialogue response selection. Motivated by the recent finding that random negatives are often too trivial to train a reliable model, we propose a hierarchical curriculum learning (HCL) framework that consists of two complementary curricula: (1) corpus-level curriculum (CC); and (2) instance-level curriculum (IC). In CC, the model gradually increases its ability in finding the matching clues between the dialogue context and response. On the other hand, IC progressively strengthens the model's ability in identifying the mismatched information between the dialogue context and response. Empirical studies on two benchmark datasets with three state-of-the-art matching models demonstrate that the proposed HCL significantly improves the model performance across various evaluation metrics.

Via

Access Paper or Ask Questions

Grayscale Data Construction and Multi-Level Ranking Objective for Dialogue Response Selection

Apr 28, 2020

Zibo Lin, Deng Cai, Yan Wang, Xiaojiang Liu, Hai-Tao Zheng, Shuming Shi

Figure 1 for Grayscale Data Construction and Multi-Level Ranking Objective for Dialogue Response Selection

Figure 2 for Grayscale Data Construction and Multi-Level Ranking Objective for Dialogue Response Selection

Figure 3 for Grayscale Data Construction and Multi-Level Ranking Objective for Dialogue Response Selection

Figure 4 for Grayscale Data Construction and Multi-Level Ranking Objective for Dialogue Response Selection

Abstract:Response selection plays a vital role in building retrieval-based conversation systems. Recent works on enhancing response selection mainly focus on inventing new neural architectures for better modeling the relation between dialogue context and response candidates. In almost all these previous works, binary-labeled training data are assumed: Every response candidate is either positive (relevant) or negative (irrelevant). We propose to automatically build training data with grayscale labels. To make full use of the grayscale training data, we propose a multi-level ranking strategy. Experimental results on two benchmark datasets show that our new training strategy significantly improves performance over existing state-of-the-art matching models in terms of various evaluation metrics.

Via

Access Paper or Ask Questions